RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTECHNOLOGY 
SYSTEMS 
BRANCH 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: jojo^j^jj 

Source: O^t 

Date Processed by STIC: 314102, 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTTN 3.0 e-mail help: patin3help(g),uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20 231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the followi ng to t he indicat ed addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 191 1 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Revised 01/29/2002 



t 



Raw Sequence Listing Error Summary 



ERROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nucleics 

Wrapped Aminos 

2 Invalid Line Length 

3 J Misaligned Amino 
Numbering 

4 Non-ASCII 

5 Variable Length 



6 Patentln 2.0 

"bug" 



7 Skipped Sequences 

(OLD RULES) 



The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping," 

The rules require that a line not exceed 72 characters in length. This includes white spaces. 

The numbering under each 5* amino acid is misaligned Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Sequences) contain n*s or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION.SEQ ID NO:X: (insert SEQ ID NO where "X"is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



8 Skipped Sequences 

(NEW RULES) 



Use of n*s or Xaa*s 



Sequence(s) _ 



missing. If intentional, please insert the following lines for each skipped sequence. 



10 



11 



12 



(NEW RULES) 



Jnvalid <213> 
Response 



Use of <220> 



Patentln 2.0 
"bug" 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa*s have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

j* /■ 

Per 1.823 of Sequence I&les, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 1 



Sequence(s) _ 



_ missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager** or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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Do©s Not Comply 0IPE 

©©riveted Diskette Needed 



RAW SEQUENCE LISTING DATE: 09/04/2002 

PATENT APPLICATION: US/10/029, 988 TIME: 16:12:39 



- Input Set : A:\Sequence Listing.txt 
. .. Output Set: N:\CRF4\09042002\J029988.raw jj)^^^^ 

3~<110> APPLICANT: Bio-Technology General Corp <\ Z07 -dv^h^H-O? TW^SiVj 

ERRORED SEQUENCES VlA^t 

3878 <210> SEQ ID NO: 204 ^ 
- 3879 <211> LENGTH: 266 B^^Ke^, 

3880 <212> TYPE: PRT 

3881 <213> ORGANISM: Homo sapiens 
— 3883 <400> SEQUENCE: 204 

3885 Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 

3886 1 5 10 15 

3888 Ala Gin Pro Ala Met Ala Glu Val Gin Leu Val Glu Ser Gly Gly Gly 

3889 20 25 30 

~ 3891 Val Val Arg Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly 
3892 35 40 ' 45 

3894 Phe Thr Phe Asp Asp Tyr Gly Met Ser Trp Val Arg Gin Ala Pro Gly 

sbl 3895 50 55 60 

3897 Lys Gly Leu Glu Trp Val Ser Gly He Asn Trp Asn Gly Gly Ser Thr 

3898 65 70 '75 80 

3900 Gly Tyr Ala Asp Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn 

3901 85 90 95 

3903 A i a xys Asn Ser Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

3904 U 100 105 HO 

3906 Thr Ala Val Tyr Tyr Cys Ala Arg Met Arg Ala Pro Val He Trp Gly 

3907 115 120 125 

3909 Gin Gly Thr Leu Val Thr Val Ser Arg Gly Gly Gly Gly Ser Gly Gly 

3910 130 135 140 

3912 Gly Gly Ser Gly Gly Gly Gly Ser Ser Glu Leu Thr Gin Asp Pro Ala 

3913 145 " ~ 150 155 160 
3915 Val Ser Val Ala Leu Gly Gin Thr Val Arg He Thr Cys Gin Gly Asp 

~~ - 3916 165 170 175 

3918 Ser Leu Arg Ser Tyr Tyr Ala Ser Trp Tyr Gin Gin Lys Pro Gly Gin 

3919 ' ' 180 185 190 

3921 Ala Pro Val Leu Val He Tyr Gly Lys Asn Asn Arg Pro Ser Gly He 

3922 195 200 205 

3924 Pro Asp Arg Phe Ser Gly Ser Ser Ser Gly Asn Thr Ala Ser Leu Thr 

3925 210 " 215 220 

3927 He Thr Gly Ala Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Asn Ser 

3928 225 ' 230 235 240 

3930 Arg Asp Ser Ser Gly Asn His Val Val Phe Gly Gly Gly Thr Lys Leu 

3931 " 245 250 255 
3933 Thr Val Leu Gly Ala Ala Ala Lys Ala Lys 



<file://C:\CRF^Outhold\VsrJ029988.htm 
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RAW SEQUENCE LISTING 
"PATENT APPLICATION: US/10/029 , 988 



DATE: 09/04/2002 
TIME: 16:12:39 




Input Set : A:\Sequence Listing.txt 

Output Set: N:\CRF4\09042002\J029988.raw . 



9/4/02 



DATE: 09/04/2002 
TIME: 16:12:40 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF4\09042002\J029988.raw 



L: 
L: 
L: 
I*:, 
L: 
L: 
L: 
L: 
L: 
L : 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L : 

±* 
L: 
L: 
L: 
h : 
L: 
L : 
L: 

„L: 
L : 
L: 
L : 
L : 
L: 
L: 
L: 

-L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 



0 M:282 W: Numeric Field Identifier Missing, 
0 M:282 W: Numeric Field Identifier Missing, 



<120> is required. 
<160> is required. 



21 M:283 
32 M:283 
43 M:283 
54 M:283 
65 M:283 
76 M:283 
105 M:283 
116 M:283 
127 M:283 
M 
M 
M 
M 



283 
283 
283 
283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 W 



W: 
W: 
W: 
W: 
W: 
W: 
W 
W 

w 
w 
w 
w 
w 
w 
w 
w 
w 
w 



138 
149 
172 
183 
194 
205 
216 
227 
250 
261 
272 M:283 
283 M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
645 M-283 
674 M:283 
M:257 
M:283 
M;341 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 



294 
356 
451 
504 
515 
556 



705 
708 
716 
737 
766 
795 
824 
853 
912 
1030 
1056 
1085 
1114 
1143 
1199 
3453 
3464 



Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
-Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
; *Missing Blank Line separator 
- Missing Blank Line separator 
"Missing Blank Line separator 
Missing' Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Feature value mis-spelled or 
Missing Blank Line separator 



(46) 



or "Xaa H used, for 



M:283 
M:283 
M:283 
M:283 
M:283 
M:283 W: 
M:283 W: 



W: 
W: 
W: 
W: 
W: 
W: 



Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 



<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
invalid, <221> Name/Key for SEQ ID# 
<400> field identifier 
SEQ ID# : 36 after pos.:32 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 



~4 
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L : 
L: 
L: 
L: 
L: 
L: 
L: 
M : 
. L: 
L: 
L; 
L: 



3475 M:283 W 

3486 M:283 W 

3497 M:283 W 

3508 M:283 W 

3519 M:283 W 



3530 M:283 W. 
3934 M:332 E 
332 Repeated 
3942 M:333 E: 
3942 M:330 E: 
3942 M:252 E: 
0 M:203 E: No 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/029 , 988 



DATE: 09/04/2002 
TIME: 16:12:40 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF4\09042002\J029988.raw 



<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 



: Missing Blank Line separator, 
: Missing Blank Line separator, 
: Missing Blank Line separator, 
: Missing Blank Line separator, 
: Missing Blank Line separator, 
; Missing Blank Line separator, 

: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 204 
in SeqNo=204 

: Wrong sequence grouping, Amino acids not in groups 1 
: (2) Invalid Amino Acid Designator, NUMBER OF INVALID KEYS:1 
: No. of Seq. differs, <211> LENGTH: Input: 266 Found: 267 SEQ: 204 

of Seq. differs, <160> Number Of Sequences : Input (0) Counted (204) 



A ■ 



v., a 



file://C:\CRF4\Outhold\VsrJ029988.htm 
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<uo> 

"<120> 

<no> 



<1S0> 

<isi> 



Smith, John; Sml thgcnc Inc. 
Example of a Sequence Listing 
01-J00001 



PCT/EP98/00001 
1998-12-31 



US 08/999.999 
1997-10-1S 



<IC0> 



c 1,0> 



<210> 

< 2 1 1 > 

< 2 1 2 > 

< 2 1 3 > 



<220> 

< 2 2 I > 

< 2 2 2 > 

<300> 
<301> 

< 302 > 

<303 > 

< 304 > 
<305> 
<306> 

< 307 > 
<)08> 

< 309> 



Patcntln version 2.0 
1 

389 

ON A A 
Paramecium sp. 



COS 
(279) 



(,389) 



Doe. Richard 

Isolation and Characterization o( 

Protease from Paramecium sp . 

Journal ot Genes 

1 

4 

1-7 

1988-06-31 

1234S6 

198D-OC-31 



1 



o Ccnc Mncoding a 



<<00> 

agccgiagtc aitcctgtgr cctcttctct ctgggciici caccctgcta atcagatctc 

• '■ 

agggagagtg tcttgaccct; cctctgcctt tgeagcttea caggcaggod ggcaggcagc 

tgatgtggca attgetggea gtgecacagg cttttcagcc aggcttaggg tgggttccgc 

cgcggcgcgg cggcccctct cgcgctcctc tcgcgcctct ctctcgctct cctctcgctc 



60 
120 
100 
240 



Annex VII. page 30 .. 

Appendix 3, page 2 . y 

„„.,.., at© Ott tea a.tg ttc age 29c 

©gacctgatt aggtgagcag gaggaggggg cagttagc ^ V4 i S cr Met Phe Scr 

r rm ttc " gtt tgt ttg ttc caa 3<< 

US E 3 ,iS * * ^ r *° c ' s » ■** cl " 

tea ctg cag ccg . ctt m 309 

cgt ccc aa* 0" etc ccc t g t cac tea cln pro ASIU 

cys Pro Lys rV*l LCu Pro Cys ius Scr , 35 



<2i0> 2 a- w. < 

<2U> 3? 

<212> PRT .-V 

<213> rar*mccium sp. ' ^ 

<<00> 2 * Lys rrp pro Cly P\K Cys Leu 

Met V«il Scr net Phc Scr Leo Scr Pl»c uy* 1, ^ 

1 ^ 

„ hc Cys Leu .,,c On Cy5 -.0 ..ys va» .-u C y£ Hi. Scr 

20 " 



Scr - 



Leu Cln Pro Asn Lou 



<2i0> 3 

<21 1> 1 1 .... 

<212> 

< 2 1 3 > Artificial Sequence 



ZIV* Ocsigned Pcnuac ,oscd on sue and polaricy <• "^..^ 
linker between the aipU* and beta cnoin* 

M« 0> Vl Asn Leu Clu Pro Mcc His Thr Clu He 

1 5 10 

<210> < 

<400> « 
000 



(Annex VIII follows! 



cable. The numeric idcBpficr shall be used only in che^^cquence 

Listing." The order ind presentation of the items of inrWmation in the 
"Sequence Listing" shall conform to the arrangement given below. Each 
item of information shall begin on a new line and shall* begin with\the # . 
numeric identifier enclosed in* angle brack ts as shown. % The submission* 
of those items of information designated with an "M" is mandatory. The 
subdiission of those items of Information designated with an "0" is 
optional. Numeric identifiers <110> through <170> shall only,, be s t 
forth at the beginning of the "Sequence Listing." T£e following table 
illustrates the numeric identifiers. 



Numeric 
Identifier 



Definition 



Comments and 
format 



Mandatory (MJ or 
Optional jfO) 



<110> 



Applicant 



<120> 



<130> 



<1<0> 



Title of 
Invention 

file Reference 



Current Applica- 
tion Number 



Preferably max. 
of 10 names; 
one name per line; 
preferable .format: 
Surname, Others 
Names and/or 
Initials 



Personal file 
re fcrcncc 



Specify as: 

US 07/999,999 or 

PCT/US96/99999 



M v 



H. when filed prior 
to assignment of 
appl . numbc r 

M, if available 



<141> 



Current Filing 
Date 



SpcciCy as: yyyy-mm-dd M, if available 



<150> 



Prior Application Specify as: 
Number us 07/999,999 or 

PCT/US96/99999 



M, if applicable 
include priority 
documents under 
35 USC 119 and 
120 



<1S1> 



Prior Application Specify as: yyyy-mm-dd M, if applicable 
filing Date 



<1G0> 



Number of SEQ 10 
NOs 



Count includes 
total number of 
SEQ 10 NOs 



<170> 



Software , ' Name of software used O 

to create the 
Sequence Listing 



<210> 



SEQ ID NO: If : 



Response shall be an 
integer repre- 
senting the SEQ 
ID NO shown 



M 



<211> 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
residues 



<2 \2> 



Type 



<213> 



Organism 



<220> 



feature 



Whether presented 
sequence mole- 
cule is ONA t 
RNA, or rRT 
(protein) • I f 
„ a nucleotide 
sequence con- 
tains both DMA 
and UNA frag- 
ments, the 
type shall be 
-DNA. M In ad- 
dition, the 
combined DMA/ 
RNA molecule 
shall be further 
described in 
the <220> to 
<223> feature 
section. 



M 



Scientific namc # 
i.e. Genus/species , V 
Unknown or Artifi- 
cial Sequence. In 
addition, the ' V 

"Unknown*' or 
"Artificial Se- 
quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 

Leave blank after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence. 



M 



M, under the 
following condi- 
tions : i f "n, " 
"Xaa, " or a mod- 
i f ied or unusual 
L-amino acid or 
modified base was 
used in a se- 
quence; if ORGAN- 
ISM is "Artifi- 
cial Sequence" or 
"Unknown" ; i f 
molecule is 
combined DNA/RNA. 



< 2 2 1 > 



Namc/Kc y 



<222> 



Locat ion 



Provide appropriate 
identi f icr for 
feature, pre- 
ferably from 
WIPO Standard 
ST. 2b (1998). 
Appendix 2, 
Tables 5 and 6 

Specify location 
within sequence; 
where appropriate 
state number of 
first and last 
bases/ammo «*cids 



M, under the fol- 
lowing conditions 
if "n, " "Xaa , " or 
a mod i f i cd or un- 
usual L-amino 
acid or modified 
base was used in 
a sequence 

M, under the fol- 
lowing conditions 
i f M n, " "Xaa. " or 
a modi f ied or un- 
usual L-amino 
acid or mod i f i cd 



<223> 



Other Infor- 
mation 



in feature 



Other relevant 

information; 

four lines maximum 



was u*{|d 
:quence 

fi f under the fol- 
lowing; conditions: 
jlf M n, M H Xaa," or 
% a modified or un- 
usual L-amino acid 

r ; modified base 
was used in a 
sequence; if 
ORGANISM 
is "Artificial 
Sequence" or . 
••Unknown"; i-f-=* : * 
moi.cculc is com- 
bincd DNA/RNA^ s 



<300> 



<301> 



Publication 
Information 

Authors 



Leave blank" 
after <300> * m 

Preferably max 
of ten named 
authors of publi- 
cation; specify 
one name per line; 
preferable format: 
Surname, Other 
llamcs and/or 
Initials 



O 



<302> 
<303> 
<304 > 
<30S> 
<30G> 
< 307 > 



< 300> 



<309> 



<310> 



Title 

Journa 1 

Volume 

Issue 

Pages 

Date 



Database 
Acccs s i on 
Number 



Database Entry 
Date 



Patent Document 
Numbe r 



Journal date on which 
data published; 
specify as yyyy-mm- 

dd, MMM-yyyy or 

Scason-yyyy 

Accession number 
assigned by dsn- 
base including 
database name 

Date of cnt ry in 
database; specify 
as y y y y - mm- dd or 
MMM-yyyy 

Document number; 
(or patent-type 
citations only. 
Spcci fy as, for 
example, US 
07/999. 999 



O 
O 
O 
O 



.i m 



|/? Q/0«J I S * I'M 



O 1 1> 



<312> 



<313> 



<<00> 



Patent Til 
Date 



Publication Date 



Relevant 
Residues 



Sequence 



Document 

filing 
date, for patent- 
type citations only; 
specify as yyyy-mm-dd 

document publication 
date, for 
patent-type 
citations only; 
specify as yyyy-mm-dd; 

FROM (position) TO 
(position) 

SEQ ID NO should 
follow the 
numeric identifier 
and should appear 
on the lincprc-^ 
ceding the actual* 
sequence 



S. Section 1.02< is revised to read as follows: 

1.02< Form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



(a) The computer readable form required by 
following speci fi cat ions: 



1 .021 (c) shall meet the 



(1) The computer readable form shall contain a single ."Sequence Listing" 
as cither a diskette, series of diskettes, or other permissible media 
outlined in paragraph (cj of this section. 

(?) The "Sequence Listing", in paragraph (a) (U of this section shall be 
submitted in American Standard Code for Information Interchange _ (ASCI I ) 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any means, such as word 
processors, nucleotide/amino acid sequence editors or other custom 
computer programs; however, it shall conform to all specifications 
detailed in this section. 

(4) File compression is acceptable when using diskette media, so long as 
the compressed file is in a sc 1 f -c>: t ract i ng format that will decompress 
on one of the systems described in paragraph (b) of this section. 



(S) Page numbering shall not appear within the computer 
version of the "Sequence Listing" file. 



readable form 



(6) All computer readable forms shall have a label permanently affixed 
thereto on which has been hand-printed or typed: the name of the 
applicant, the title of the invention, the date on which the data were 
recorded on the computer readable form, the operating system used, a 
reference number, and an application serial number. and filing date, if 
known . 

(b) Computer readable form submissions must meet these format 
rcqui remcnt s : 

(11 Computer: I DM PC/XT/AT. or compatibles, ot Apple Macintosh; 



(21 Operating System: MS-DOS. Unix or Macintosh; 



